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5» Cln Cly His Gly Thr Cys He Asp Cly He Cly Ser Phe Set Cys Asp Cys Arg Ser Cly Trp Clu Gly Arg Phe Cys Gin Arc 

C CAA GGG CAC CGC ACG TCC ATC GAC GGC ATC CCC ACC TTC ACC TGC GAC TCC CCC AGC CCC TCC GAG CCC CCC TTC TCC CAG CCC 

10 20 30 40 50 6 0 70 80 

100 110 120 

Clu Val Ser Phe Leu Asn Cys Ser Leu Asp Asn Gly Gly Cys Thr His Tyr Cys Leu Clu Glu Val Gly Trp Arg Arg Cys Ser Cys Ala 

CAG CTC AGC TTC CTC AAT TGC TCT CTC GAC AAC GGC GGC TCC ACG CAT TAC TCC CTA GAG GAG GTC CCC TCC CGC CGC TCT ACC TCT CCC 

90 100 110 120 130 140 150 160 170 

130 140 150 

Pro Cly Tyr Lys Leu Cly Asp Asp Leu Leu Gin Cys His Pro Ala Val Lys Phe Pro Cys Gly Arg Pro Trp Lys Arg Met Glu Lys Lys 

CCT CCC TAC AAC CTC GGG GAC GAC CTC CTC CAC TCT CAC CCC CCA GTC AAC TTC CCT TCT GGG AGC CCC TCC AAC CCC ATC GAC AAC AAC 

180 190 200 210 220 230 240 250 260 



eu^Lvs Aro"Asi 



160 



1170 



180 



Arg Ser His Leu^Lys Arg^Asp Thr Clu Asp Cln Clu Asp Gin Val Asp Pro Arg'Leu He Asp Gly Lys'htet Thr Arg Arg Gly Asp Ser 
CCC ACT CAC CTC AAA CGA GAC ACA CAA GAC CAA CAA CAC CAA CTA CAT CCC CCC CTC ATT CAT GGG AAC ATC ACC ACG CGC GGA GAC ACC 
270 280 290 300 310 320 330 340 350 

190 200 210 

Pro Trp Cln Val Val Leu Leu Asp Ser Lys Lys Lys Leu Ala Cys Cly Ala Val Leu He His Pro Ser Trp Val Leu Thr Ala Ala His 

CCC TGC CAG CTC GTC CTC CTC CAC TCA AAC AAC AAC CTC CCC TGC CCC CCA CTC CTC ATC CAC CCC TCC TCC CTC CTC ACA CCC CCC CAC 

360 370 380 390 400 410 420 430 440 

220 230 240 

Cys Met Asp Clu Ser Lys Lys Leu Leu Val Arg Leu Gly Glu Tyr Asp Leu Arg Arg Trp Glu Lys Trp Glu Leu Asp Leu Asp He Lys 
TGC ATG GAC GAG TCC AAG AAG CTC CTT GTC AGG CTT GGA GAG TAT CAC CTC CGC CGC TGC CAG AAC TGC GAG CTC CAC CTC GAC ATC AAG 
450 - 460 470 480 490 500 510 520 530 

+ 250 260 270 

Clu Val Phe Val His Pro Asn Tyr Ser Lys Ser- Thr Thr Asp Asn Asp He Ala Leu Leu His Leu Ala Gin Pro Ala Thr Leu Ser Gin 
GAC CTC TTC GTC CAC CCC AAC TAC ACC AAG AGC ACC ACC GAC AAT GAC ATC CCA CTC CTC CAC CTG CCC CAG CCC CCC ACC CTC TCC CAG 
540 550 560 570 580 590 600 610 620 

280 290 300 

Thr He Val Pro He Cys Leu Pro Asp Ser Cly Leu Ala Clu Arg Glu Leu Asn Gin Ala Gly Gin Glu Thr Leu Val Thr Cly Trp Gly 

ACC ATA CTG CCC ATC TGC CTC CCC. CAC ACC CCC CTT CCA GAG CCC GAC CTC AAT CAG CCC CGC CAG GAC ACC CTC CTG ACG GGC TCC GGC 
630 640 650 660 670 680 690 70*0 710 

310 + 320 + 330 

Tyr His Ser Ser Arg Glu Lys Glu Ala Lys Arg Asn Arg Thr Phe Val Leu Asn Phe He Lys He Pro Val Val Pro His Asn Clu Cys 
TAC CAC AGC ACC CGA GAG AAC GAG GCC AAC ACA AAC CGC ACC TTC CTC CTC AAC TTC ATC AAC ATT CCC CTG CTC CCC CAC AAT CAC TGC 
720 730 740 750 760 770 780 790 BOO 

340 350 360 

Ser Glu Val Met Ser Asn Met Val Ser Clu Asn Met Leu Cys Ala Cly He Leu Gly Asp Arg Gin Asp Ala Cys Glu Gly Asp Ser Gly 
ACC CAC CTC ATC AGC AAC ATC CTC TCT GAC AAC ATC CTG TCT GCC CGC ATC CTC CCC CAC CGC CAC CAT CCC TCC CAC GGC CAC ACT GCC 
810 820 830 840 850 860 870 880 890 

370 380 390 

Cly Pro Met Val Ala Ser Phe His Cly Thr Trp Phe Leu Val Gly Leu Val Ser Trp Cly Clu Cly Cys Gly Leu Leu His Asn Tyr Cly 

CCC CCC ATC GTC CCC TCC TTC CAC GCC ACC TGC TTC CTG CTG GGC CTC CTC ACC TCC CCT GAG CGC TCT CGC CTC CTT CAC AAC TAC GCC 

900 910 920 930 940 950 960 970 980 

400 410 419 

Val Tyr Thr Lys Val Ser Arg Tyr Leu Asp Trp He His Gly His He Arg Asp Lys Clu Ala Pro Gin Lys Ser Trp Ala Pro STOP 

CTT TAC ACC AAA GTC ACC CCC TAC CTC CAC TGC ATC CAT GGG CAC ATC ACA CAC AAG CAA CCC CCC CAG AAG AGC TCC CCA CCT TAC CCA 

990 1000 1010 1020 1030 1040 1050 ' 1060 1070 



CCC TCC CTC CAC GGC TGC GCT TTT CCA TGC CAA TCC ATG GGA cj\T TAA a| GG CAC ATG TAA CAA CCA CAC CCC CCT CCT CTT CTC TCC TTC 
1080 1090 1100 1110 1120 1130 1140 1150 1160 



CAT CCC TCT TTT GGC CTC TTC TCC ACC CAA CTA ACA TTT ACT GAC CAC CTC TTC TAT CTC ACA TGC CTT ATC AAT AGA ATC TTA ACT CCT 
1170 1180 1190 1200 1210 1220 1230 1240 1250 



AGA CCA ACT CTG TCC GGT GGC CAC GAC CAC ATC CAA CTT TTC CGC CCT CTA AAC CTC TCT GTC TTG AGC GGG ATA CTC TCT TTA TGA AAA 
1260 1270 1280 1290 1300 1310 1320 1330 1340 



A Ck ATA A/| a AAC ACA ACC ACG AAA AAA AAA 
1350 1360 1370 3 

Fig. 2. Nucleotide sequence of the cDNA inserts in XHC1026 and XHC1375 that code for human protein C. The predicted amino acid 
sequence is shown starting with residue 64 in the light chain. This numbering assumes that the length of the light chain of bovine and human 
protein C is identical. ♦ , Potential carbohydrate binding sites to asparagine residues; i , apparent cleavage sites for removal of the connecting 
dipeptide; I , site of cleavage in the heavy chain when protein C is converted to activated protein C; • . site of polyadenylylation in AHC1026. 
The processing or polyadenylylation sequences are shown in boxes. 



pears to be one of the few serine proteases that has a leucine 
rather than an isoleucine or valine in this position. 

The nucleotide sequences near the active-site serine for 
protein C and three other human vitamin K-dependent serine 
protease cDNAs (25, 34-36) are shown in Fig. 4. DNA and 
amino acid sequence homology in this region is highly con- 



served. The cDNAs isolated in this investigation lack the 5' 
end. This region codes for approximately 63 additional ami- 
no acids present in the light chain and a leader sequence that 
is typical of secreted proteins (37). It should be of interest to 
compare the DNA sequence corresponding to this region of 
protein C with factor IX and factor X, since these three pro- 



